Sequential decision making in repeated coalition formation under uncertainty

نویسندگان

  • Georgios Chalkiadakis
  • Craig Boutilier
چکیده

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learning framework is developed for this problem when coalitions are formed (and tasks undertaken) repeatedly: not only does the model allow agents to refine their beliefs about the types of others, but uses value of information to define optimal exploration policies. However, computational approximations in that work are purely myopic. We present novel, non-myopic learning algorithms to approximate the optimal Bayesian solution, providing tractable means to ensure good sequential performance. We evaluate our algorithms in a variety of settings, and show that one, in particular, exhibits consistently good sequential performance. Further, it enables the Bayesian agents to transfer acquired knowledge among different dynamic tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Approach to Multiagent Reinforcement Learning

A Bayesian Approach to Multiagent Reinforcement Learning and Coalition Formation under Uncertainty Georgios Chalkiadakis Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2007 Sequential decision making under uncertainty is always a challenge for autonomous agents populating a multiagent environment, since their behaviour is inevitably influenced by the behaviou...

متن کامل

Coalition analysis with preference uncertainty in group decision support

Coalition analysis is extended to incorporate uncertain preference into three stability concepts, general metarationality (GMR), symmetric metarationality (SMR), and sequential stability (SEQ) under the paradigm of the graph model for conflict resolution. As a follow-up analysis in the graph model, coalition analysis aims to assess whether equilibriums under individual calculations are vulnerab...

متن کامل

Convergence in a sequential two stages decision making process

We analyze a sequential decision making process, in which at each stepthe decision is made in two stages. In the rst stage a partially optimalaction is chosen, which allows the decision maker to learn how to improveit under the new environment. We show how inertia (cost of changing)may lead the process to converge to a routine where no further changesare made. We illustrate our scheme with some...

متن کامل

Utilizing Decision Making Methods and Optimization Techniques to Develop a Model for International Facility Location Problem under Uncertainty

Abstract The purpose of this study is to consider an international facility location problem under uncertainty and present an integrated model for strategic and operational planning. The paper offers two methodologies for the location selection decision. First the extended VIKOR method for decision making problem with interval numbers is presented as a methodology for strategic evaluation of po...

متن کامل

Efficient Methods for Near-Optimal Sequential Decision Making under Uncertainty

This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal sequential decisions under uncertainty about the environment. Due to the uncertainty, such algorithms must not only learn from their interaction with the environment but also perform as well as possible while learning i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008